Non-record: hybrid spiking Transformer (SNN)with a multi-step spiking MLP by tsbiosky · Pull Request #664 · openai/parameter-golf

tsbiosky · 2026-03-25T00:53:40Z

Hybrid Spiking Neural Networks (SNNs) MLP

val_bpb: 1.2982 | 15.78 MB | 8×H100 SXM

A contest-friendly hybrid SNN submission built from the train_gpt.py baseline: keep dense GQA attention and the original training/eval/compression pipeline, but replace the standard feed-forward block with a small multi-step leaky integrate-and-fire (LIF-style) spiking MLP.

Reference :https://arxiv.org/pdf/2203.14679

Why this is interesting

This is not a fully spiking language model. It is a hybrid Transformer + SNN-MLP design:

embeddings, attention, residual path, and logits remain standard dense LM components
only the feed-forward block is replaced by a spiking mechanism
the original Parameter Golf training and export path stays intact

That makes the experiment meaningful for the contest setting because it isolates one question:

Can spike neural network achieves good performance in a tiny language model under a strict size budget?

non-record submission

d8a3cfb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-record: hybrid spiking Transformer (SNN)with a multi-step spiking MLP#664

Non-record: hybrid spiking Transformer (SNN)with a multi-step spiking MLP#664
tsbiosky wants to merge 1 commit intoopenai:mainfrom
tsbiosky:main

tsbiosky commented Mar 25, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

tsbiosky commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Hybrid Spiking Neural Networks (SNNs) MLP

Why this is interesting

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

tsbiosky commented Mar 25, 2026 •

edited

Loading